Neural Sequence Prediction by Coaching

نویسندگان

  • Wenhu Chen
  • Guanlin Li
  • Shujie Liu
  • Zhirui Zhang
  • Mu Li
  • Ming Zhou
چکیده

Maximum Likelihood Estimation (MLE) suffers from data sparsity problem in sequence prediction tasks where training resource is rare. In order to alleviate this problem, in this paper, we propose a novel generative bridging network (GBN) to train sequence prediction models, which contains a generator and a bridge. Unlike MLE directly maximizing the likelihood of the ground truth, the bridge extends the point-wise ground truth to a bridge distribution (containing inexhaustible examples), and the generator is trained to minimize their KL-divergence. In order to guide the training of generator with additional signals, the bridge distribution can be set or trained to possess specific properties, by using different constraints. More specifically, to increase output diversity, enhance language smoothness and relieve learning burden, three different regularization constraints are introduced to construct bridge distributions. By combining these bridges with a sequence generator, three independent GBNs are proposed, namely uniform GBN, language-model GBN and coaching GBN. Experiment conducted on two recognized sequence prediction tasks (machine translation and abstractive text summarization) shows that our proposed GBNs can yield significant improvements over strong baseline systems. Furthermore, by analyzing samples drawn from bridge distributions, expected influences on the sequence model training are verified.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Protein Secondary Structure Prediction: a Literature Review with Focus on Machine Learning Approaches

DNA sequence, containing all genetic traits is not a functional entity. Instead, it transfers to protein sequences by transcription and translation processes. This protein sequence takes on a 3D structure later, which is a functional unit and can manage biological interactions using the information encoded in DNA. Every life process one can figure is undertaken by proteins with specific functio...

متن کامل

Stream Flow Prediction in Flood Plain by Using Artificial Neural Network (Case Study: Sepidroud Watershed)

In order to determine hydrological behavior and water management of Sepidroud River (North of Iran-Guilan) the present study has focused on stream flow prediction by using artificial neural network. Ten years observed inflow data (2000-2009) of Sepidroud River were selected; then these data have been forecasted by using neural network. Finally, predicted results are compared to the observed dat...

متن کامل

Prediction of methanol loss by hydrocarbon gas phase in hydrate inhibition unit by back propagation neural networks

Gas hydrate often occurs in natural gas pipelines and process equipment at high pressure and low temperature. Methanol as a hydrate inhibitor injects to the potential hydrate systems and then recovers from the gas phase and re-injects to the system. Since methanol loss imposes an extra cost on the gas processing plants, designing a process for its reduction is necessary. In this study, an accur...

متن کامل

Prediction of Bending Angle for Laser Forming of Tailor Machined Blanks by Neural Network

Tailor-made blanks are sheet metal assemblies with different thicknesses and/or materials and/or surface coatings. A monolithic sheet can be machined to make the required thickness variations that is referred as tailor machined blanks. Due to the thickness variation in tailor machined blanks, laser bending of these blanks is more complicated than monolithic plates. In this article, laser formin...

متن کامل

Traffic Signal Prediction Using Elman Neural Network and Particle Swarm Optimization

Prediction of traffic is very crucial for its management. Because of human involvement in the generation of this phenomenon, traffic signal is normally accompanied by noise and high levels of non-stationarity. Therefore, traffic signal prediction as one of the important subjects of study has attracted researchers’ interests. In this study, a combinatorial approach is proposed for traffic signal...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • CoRR

دوره abs/1706.09152  شماره 

صفحات  -

تاریخ انتشار 2017